Local Citation Recommendation with Hierarchical-Attention Text Encoder and SciBERT-Based Reranking

نویسندگان

چکیده

The goal of local citation recommendation is to recommend a missing reference from the context and optionally also global context. To balance tradeoff between speed accuracy in large-scale paper database, viable approach first prefetch limited number relevant documents using efficient ranking methods then perform fine-grained reranking more sophisticated models. In that vein, BM25 has been found be tough-to-beat prefetching, which why recent work focused mainly on step. Even so, we explore prefetching with nearest neighbor search among text embeddings constructed by hierarchical attention network. When coupled SciBERT reranker fine-tuned tasks, our Attention encoder (HAtten) achieves high recall for given candidates reranked. Consequently, requires fewer rerank, yet still state-of-the-art performance various datasets such as ACL-200, FullTextPeerRead, RefSeer, arXiv.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Incremental Reranking for Hierarchical Text Classification

The top-down method is efficient and commonly used in hierarchical text classification. Its main drawback is the error propagation from the higher to the lower nodes. To address this issue we propose an efficient incremental reranking model of the top-down classifier decisions. We build a multiclassifier for each hierarchy node, constituted by the latter and its children. Then we generate sever...

متن کامل

Content-Based Citation Recommendation

We present a content-based method for recommending citations in an academic paper draft. We embed a given query document into a vector space, then use its nearest neighbors as candidates, and rerank the candidates using a discriminative model trained to distinguish between observed and unobserved citations. Unlike previous work, our method does not require metadata such as author names which ca...

متن کامل

A Hierarchical Contextual Attention-based GRU Network for Sequential Recommendation

Sequential recommendation is one of fundamental tasks for Web applications. Previous methods are mostly based on Markov chains with a strong Markov assumption. Recently, recurrent neural networks (RNNs) are getting more and more popular and has demonstrated its effectiveness in many tasks. The last hidden state is usually applied as the sequence’s representation to make recommendation. Benefit ...

متن کامل

Large-scale Structural Reranking for Hierarchical Text Categorization

Current hierarchical text categorization (HTC) methods mainly fall into three directions: (1) Flat one-vs.-all approach, which flattens the hierarchy into independent nodes and trains a binary one-vs.-all classifier for each node. (2) Top-down method, which uses the hierarchical structure to decompose the entire problem into a set of smaller subproblems, and deals with such sub-problems in top-...

متن کامل

Citation Recommendation via Proximity Full-Text Citation Analysis and Supervised Topical Prior

Currently the many publications are now available electronically and online, which has had a significant effect, while brought several challenges. With the objective to enhance citation recommendation based on innovative text and graph mining algorithms along with full-text citation analysis, we utilized proximitybased citation contexts extracted from a large number of full-text publications, a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-99736-6_19